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Abstract 

In this paper, we prove that a set of N points in R 2 has at least c lo ^ N distinct 
distances, thus obtaining the sharp exponent in a problem of Erdos. We follow the 
set-up of Elekcs and Sharir which, in the spirit of the Erlangen program, allows us to 
study the problem in the group of rigid motions of the plane. This converts the problem 
to one of point-line incidences in space. We introduce two new ideas in our proof. In 
order to control points where many lines are incident, we create a cell decompostion 
using the polynomial ham sandwich theorem. This creates a dichotomy: either most 
of the points are in the interiors of the cells, in which case we immediately get sharp 
results, or alternatively the points lie on the walls of the cells, in which case they 
are in the zero set of a polynomial of suprisingly low degree, and we may apply the 
algebraic method. In order to control points where only two lines are incident, we use 
the flecnodc polynomial of the Rev. George Salmon to conclude that most of the lines 
lie on a ruled surface. Then we use the geometry of ruled surfaces to complete the 
proof. 



1 Introduction 

In [E], Paul Erdos posed the question: how few distinct distances are determined by N 
points in the plane. Erdos checked that if the points are arranged in a square grid, then 
the number of distinct distances is ~ N . He conjectured that for any arrangement of 

N points, the number of distinct distances is > y/iogN ' (Throughout this paper, we use the 
notation A > B to mean that there is a universal constant C > with A > CB.) 

In the present paper, we prove 

Theorem 1.1. A set of N points in the plane determines > \ Q gN distinct distances. 

Various authors have proved lower bounds for the number of distinct distances. These 
include but are not limited to [M]> |CSzT| . [Sz] . |SoTo] . [T] . The most recent lower bound, 
in |KTj . says that the number of distances is > jy 8641 . p or a more thorough presentation 
of the history of the subject see the forthcoming book [GIS]. 



In [ES| . Elekes and Sharir introduced a completely new approach to the distinct distance 
problem, which uses the symmetries of the problem in a novel way. They laid out a 
plan to prove Theorem II. 1\ which we follow in this paper. Their approach connects the 
distinct distance problem to three-dimensional incidence geometry. Using their arguments, 
Theorem 1.1 follows from the following estimate about the incidences of lines in M 3 . 
Theorem 1.2. Let £ be a set of N 2 lines in R 3 . Suppose that £ contains < N lines in 
any plane or any regulus. Suppose that 2 < k < N. Then the number of points that lie in 
at least k lines is < iV 3 /c~ 2 . 

Recently, there has been a lot of progress in incidence geometry coming from the polyno- 
mial method. In [D], Dvir used the polynomial method to prove the finite field Kakeya 
conjecture, which can be considered as a problem in incidence geometry over finite fields. 
In [GK] . the polynomial method was applied to incidence geometry problems in M 3 , solv- 
ing the joints problem. The method was simplified and generalized in [EKS] . [KSSj . and 
[Ql - Kaplan, Sharir, and Shustin ( [KSS] ) and Quilodran (|Q|) solved the joints problem in 
higher dimensions. For context, we mention here the joints theorem in n dimensions. 
Theorem 1.3. fjKSS] . JQ$) Let n > 3. Let £ be a set of L lines in R n . A joint of £ is 
a point that lies in n lines of £ with linearly independent directions. The number of joints 
of £ is < C n L^ . 

One of the remarkable things about the polynomial method is how short the proofs are. 
The finite field Kakeya problem and the joints problem were considered to be very difficult, 
and many ideas were tried in both cases. The proof of the finite field Kakeya result (|DJ) 
and the simplified proof of the joints theorem ( [KSS] or [Q]) are each about one page long. 
This simplicity gives the feeling that these are the "right" proofs for these theorems. 

In [EKSJ, Elekes, Kaplan, and Sharir used the polynomial method to prove the case k = 3 
of Theorem ll.2[ (It is a special case of Theorem 9 of that paper.) It remains to prove 
Theorem 11.21 when k = 2 and when k is large. This requires two new ideas. When k = 2, 
the key new idea is an application of ruled surfaces. When k is large, the key new idea is 
an application of a ham sandwich theorem from topology. Let's discuss these new ideas in 
more detail. 

First we explain the extra difficulty that occurs when k = 2, as opposed to k = 3. The 
fundamental idea of the polynomial method is to find a polynomial p of controlled degree 
whose zero set Z contains the set of lines £. Then one uses the geometry of Z to study £. 
A point where three lines of £ intersect is an unusual point of the surface Z: it is either 
a critical point of Z or else a 'flat' point of Z. One can use algebraic geometry to control 
the critical points and flat points of Z in terms of the degree of p. But a point of Z where 
two lines of £ intersect does not have to be either critical or flat, and we don't know of any 
special property of such a point. 

Reguli play an important role in the case k = 2. If 1%, I2, and I3 are three pairwise-skew 
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lines in R 3 , there is a 1-parameter family of lines that intersects all three lines. The union 
of the lines in the 1-parameter family is a surface called a regulus. A regulus is a degree 
2 algebraic surface. An example is the surface defined by the equation z = xy. A set of 
N 2 lines in a regulus can have ~ iV 4 points of intersection. A regulus is an example of a 
ruled surface. In this paper, a ruled surface means an algebraic surface which contains a 
line through each point. 

We apply the theory of ruled surfaces to prove our estimate in the case k = 2. We 
first observe that if a surface Z of controlled degree contains too many lines, then some 
component of the surface Z must be ruled. In this way, we can reduce to the case of a 
set of lines contained in a ruled surface of controlled degree. Ruled surfaces have some 
special structure, and we use that structure to bound the intersections between the lines. 
A ruled surface is called singly-ruled if a generic point in the surface lies in only one line 
in the surface. It is well known that planes and reguli are not singly-ruled, but every other 
irreducible ruled surface is. The reason is that if a surface is not singly-ruled, it is easy to 
find three lines and I3 in the surface which meet infinitely many lines not at one of 

the possibly three points of intersection of li, 1%, and I3. This implies that the surface has 
a factor which is a plane, if any two of h,l2, and £3 is coplanar and a regulus if they are 
pairwise skew. A point where two lines intersect inside of a singly-ruled surface must be 
critical - except for points lying in the union of a controlled number of lines and a finite 
set of additional exceptions. By using this type of result, the structure of ruled surfaces 
helps us to prove our estimate. 

Next we try to explain the extra difficulty that occurs for large k. An indication of the 
difficulty is that for large k, Theorem 11.21 does not hold over finite fields. (When k = 2 
or 3, it's an open question whether Theorem 11.21 holds over finite fields, but we suspect 
that it does.) The counterexample occurs when one considers £ to be all of the lines in 
F 3 . (Here, F denotes a finite field.) This situation is reminiscent of the situation for the 
Szemeredi- Trotter theorem. 

The Szemeredi- Trotter incidence theorem ( [SzTj ) is the most fundamental and important 
result in extremal incidence geometry. It was partly inspired by Erdos's distance problem, 
and it has played a role in all the recent work on the subject. 

Theorem 1.4. (Szemeredi- Trotter) Let £ be a set of L lines in R 2 . Then the number of 
points that lie in at least k lines is < C(L 2 k~ 5 + Lk^ 1 ). 

The Szemeredi- Trotter theorem is also false over finite fields: the counterexample occurs 
when one considers £ to be all of the lines in F 2 . All of the proofs of the theorem involve 
in some way the topology of R 2 . One approach, which is important in our paper, is the 
cellular method introduced in the seminal paper [CEGSW] by Clarkson, Edelsbrunner, 
Guibas, Sharir, and Welzl. The cellular method is a kind of divide-and-conquer argument. 
One carefully picks some lines, which divide the plane into cells, and then one studies £ 
inside of each cell. 
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The cellular method has been very successful for problems in the plane, but only partly 
successful in higher dimensions. For example, in |FS| . Feldman and Sharir attacked the 
(3-dimensional) joints problem using the cellular method (among other tools). They were 
able to prove that the number of joints determined by L lines is < L L62 . (For contrast, 
the algebraic method gives < L 3 / 2 .) 

It seems to us that there are strong analogies between Theorem 11.21 and the Szemeredi- 
Trotter theorem, and also between Theorem 11.21 and the joints theorem. As in the Sze- 
meredi- Trotter theorem, topology must play some role. As in the joints theorem, it is 
natural for polynomials to play some role. 

To prove Theorem 11.21 when k is large, we construct a cell decomposition where the walls of 
the cells form an algebraic surface Z defined by a polynomial p. The polynomial is found 
by a topological argument, using the polynomial version of the ham sandwich theorem. At 
this point, our argument involves a dichotomy. Let 6 denote the points that lie in at least 
k lines of £. In one extreme case, the points of (5 are evenly distributed among the open 
cells of our decomposition. In this case, we prove our estimate by the cellular method, 
similar to arguments from [CEGSWJ. In another extreme case, the points of 6 all lie in 
Z. In this case, it turns out that the lines of £ also lie in Z. In this case, we prove our 
estimate by the polynomial method, studying the critical and flat points of Z as in [GK| 
or pCS] . 

In Section 2, we explain the plan laid out by Elekes and Sharir. In particular, we explain 
how Theorem 11.11 follows from Theorem 1 1.21 In Section 3, we prove Theorem ll.2l in the case 
k = 2 using the ruled surfaces method. We begin with the necessary background on ruled 
surfaces. In Section 4, we prove Theorem 11.21 for k > 3 using the polynomial cell method. 
We begin with background on the polynomial ham sandwich theorem. In an appendix, we 
follow how our argument plays out when the set of points is a square grid. This example 
shows that several of our estimates are sharp up to constant factors, including Theorem 

o 
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2 Elekes-Sharir framework 



Elekes and Sharir |ES] developed a completely new approach to the distinct distance prob- 
lem, connecting it to incidence geometry in 3-dimensional space. In this section, we present 
(a small variation of) their work. 

Let P C R 2 be a set of N points. We let d(P) denote the set of non-zero distances among 
points of P. 

d(P) ■= {d(p,q)} p ^ P ^ q . 

To obtain a lower bound on the size of d(P), we will prove an upper bound on a set of 
quadruples. We let Q(P) be the set of quadruples, {pi,P2,P3,P<±) £ P 4 satisfying 



d(pi,Pa) = d(p3,P4)^0. (2.1) 

We refer to the elements of Q(P) as distance quadruples. If d(P) is small, then Q(P) needs 
to be large. By applying the Cauchy-Schwarz inequality, we easily obtain the following 
inequality. 

Lemma 2.1. For any set P C R 2 with N points, the following inequality holds. 



, .. N A - 27V 3 

Proof. Consider the distances in d(P), which we denote by d%,...,d m with m = \d(P)\. 
There are N 2 — N ordered pairs (j>i,Pj) 6 P 2 with pi ^ pj. Let ni be the number of these 
pairs at distance dj. So YliLi n i = N 2 — N. 

The cardinality |Q(-P)| is equal to Y^Li n i - by Cauchy-Schwarz, 



(m \ * / m \ 

Y>A ^ \ YyAm=\Q{P)\\d{P)\. 

Rearranging, we see that \d(P)\ > (N 2 - NflQiP)^ 1 . □ 

To prove Theorem II. 1\ it suffices to prove the following upper bound on |Q(P)|. 
Proposition 2.2. For any set P C R 2 of N points, the number of quadruples in Q{P) is 
bounded by \Q(P)\ < iV 3 logiV. 
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This Proposition is sharp up to constant factors when P is a square grid (see the ap- 
pendix). 

Elekes and Sharir study Q(P) from a novel point of view related to the symmetries of the 
plane. We let G denote the group of positively oriented rigid motions of the plane. The 
first connection between Q{P) and G comes from the following simple proposition. 
Proposition 2.3. Let (pi,P2,P3,P4) be a distance quadruple in Q(P). Then there is a 
unique g 6 G so that g{p\) = P3 and g(p%) = Pa- 

Proof. All positively oriented rigid motions taking p\ to p% can be obtained from the 
translation from p\ to p% by applying a rotation R about the point p%. Since d(p3,p^) = 
d(j>i,P2) > 0, there is a unique such rotation sending p2 + p 3 — pi into p%. □ 

Using Proposition 12.31 we get a map E from Q(P) to G, which associates to each distance 
quadruple (pi,P2,P3,P4) G Q(P) ; the unique g G G with g(pi) = p 3 and g(p2) = Pa- The 
letter E here stands for Elekes, who introduced this idea. 

Our goal is to use the map E to help us estimate |Q(P)| by counting appropriate rigid 
motions. It's important to note that the map E is not necessarily injective. The number of 
quadruples in E^ 1 (g) depends on the size of PCigP. We make this precise in the following 
lemma. 

Lemma 2.4. Suppose that g G G is a rigid motion and that \P n gP\ = k. Then the 
number of quadruples in E^ 1 (g) is 2( 2 ) . 

Proof. Suppose that PC\gP is {qi, qu}- Let pi = g~ 1 (qi). Since qi lies in gP, each point 
Pi lies in P. For any ordered pair (qi,qj) with qi ^ qj, the set (pi,Pj,qi,qj) is a distance 
quadruple. This assertion is easy to check. We have seen that Pi,Pj, qi, qj all lie in P. Since 
g preserves distances, d(pi,pj) = d(qi,qj). Since qi / qj, the distance d(qi,qj) ^ 0. 

Now we check that every distance quadruple in E^ 1 (g) is of this form. Let (pi,P2,P3,P4) 
be a distance quadruple in E^ 1 {g). We know that g{p\) = P3 and g{p2) = Pi- So P3,Pi lie 
in P n gP. Say p 3 = qi and p 4 = qj. Now p 1 = g^fa) = Pi and p 2 = g' 1 ^) =Pj. □ 

Let G =k (P) C G be the set of g £ G with |PngP| = k. Notice that G =N (P) is a subgroup 
of G. It is the group of orientation-preserving symmetries of the set P. For other k, G=k(P) 
is not a group, but these sets can still be regarded as sets of "partial symmetries" of P. 
Since P has elements, G=k(P) is empty for k > N. 

By Lemma l2~4l we can count |Q(P)| in terms of \G=k(P)\- 

\Q(P)\=J2^ 2 )\G=k{P)\. 

k=2 ^ ' 
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Let G k (P) C G be the set of g G G so that |P n gP\ > k. We see that \G =k (P)\ = 
\Gk(P)\ — \Gk+i(P)\- Plugging this into the last equation and rearranging, we get the 
following. 

N ... N 
\Q(P)\ = 2 (|Gfc(P)l " = E( 2& - 2 )\°k(P)l (2-2) 

k=2 ^ ' fc=2 

We will bound the number of partial symmetries as follows. 

Proposition 2.5. For any set P C R 2 of N points, and any 2 < k < N, the size ofGk(P) 
is bounded as follows 

\G k (P)\<N 3 k~ 2 . 

When P is a square grid, this estimate is sharp up to constant factors for all 2 < k < N (see 
the appendix). Plugging this bound into equation 12.21 we get \Q(P)\ ^5 -/V 3 log TV, proving 
Proposition 12.21 This in turn implies our main theorem, Theorem II. H So it suffices to 
prove Proposition 12.51 

Next Elekes and Sharir related the sets Gk(P) to an incidence problem involving certain 
curves in G. For any points p,q £ R 2 , define the set S pq C G given by 

S pq = {g G G : g{p) = q}. 

Each Spq is a smooth 1-dimensional curve in the 3-dimensional Lie group G. The sets 
Gk(P) are closely related to the curves S pq . 

Lemma 2.6. A rigid motion g lies in Gk(P) if and only if it lies in at least k of the curves 

{Spq}p,qeP- 

Proof. First suppose that g lies in Gfc(P). By definition, |P n gP\ > k. Let qi,...,qk be 
distinct points in P n gP. Let pi = g~ 1 (qi)- Since qi G gP, we see that pi lies in P. Since 
g{Pi) = 9i> we can sa y that g lies in S Piqi for i = 1, k. Since the qi are all distinct, these 
are k distinct curves. 

On the other hand, suppose that g lies in the curves S Piqi , S Pkqk , where we assume that 
the pairs (pi, qi), (pt,qk) are all distinct. We claim that gi, qk are distinct points. To 
see this, suppose that = qj. Since g is a bijection, we see that pi = g~ 1 {q{) = g^ 1 {qj) = 
Pj, and this gives a contradiction. But the points q%, g& all lie in P n gP. □ 

Bounding Gfc(P) is a problem of incidence geometry about the curves {S pq } P:q& p in the 
group G. By making a careful change of coordinates, we can reduce this problem to an 
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incidence problem for lines in R 3 . (Our change of coordinates is slightly nicer than the 
one in [ESj . In the coordinates of [ESj . the curves {S pq } become helices - a certain of class 
of degree 2 curves in R 3 .) 

Let G 1 denote the open subset of the orientable rigid motion group G given by rigid motions 
which are not translations. We can write G as a disjoint union G' U G trans , where G trans 
denotes the translations. We then divide Gk(P) = G' k U G k rans . Translations are a very 
special class of rigid motions, and it is fairly easy to bound | G k rans (P) | < N 3 k~ 2 . We carry 
out this minor step at the end of this Section. The main point is to bound \G' k (P)\. To do 
this, we pick a nice set of coordinates p : G' — s- R 3 . 

Each element of G' has a unique fixed point (x, y) and an angle of rotation about the 
fixed point with < 9 < 2ir. We define the map 



p : G' 



R d 



by 



P(x,y,0) = (x,y,cot -). 



Proposition 2.7. Let p = (p x ,Py) and q = {q x ,%) be points in R . Then with p as above, 

lpq 



the set p{S pq flG') is a line in R 3 . 



Proof. Noting that the fixed point of any transformation taking p to q must lie on the 
perpindicular bisector of p and q, the reader will easily verify that the set p(S pq n G') can 
be parametrized as 

( &+*,aL+* i0 ) +t (aLza,*z*i). (M) 



□ 



For any p,q £ R 2 , let L pq denote the line p(S pq n G'). The line L pq is parametrized by 
equation 12.31 Let £ be the set of lines {L pq } Ptq ^p. By examining the parametrization in 
equation 12.31 it's easy to check that these are A^ 2 distinct lines. If g lies in G' k (P), then 
p(g) lies in at least k lines of £. In the remainder of the paper, we will study the set of 
lines £ and estimate the number of points lying in k lines. 

We would like to prove that there are < N 3 k~ 2 points that lie in at least k lines of £. 
Such an estimate does not hold for an arbitrary set of N 2 lines. For example, if all the 
lines of £ lie in a plane, then one may expect ~ iV 4 points that lie in at least 2 lines. This 
number of intersection points is far too high. There is another important example, which 
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occurs when all the lines lie in a regulus. Recall that a regulus is a doubly-ruled surface, 
and each line from one ruling intersects all the lines from the other ruling. If £ contained 
N 2 /2 lines in each of the rulings, then we would have ~ N 4 points that lie in at least 2 
lines. Because of this example, we have to show that not too many lines of £ lie in a plane 
or a regulus. 

Proposition 2.8. No more than N lines of £ lie in a single plane. No more than 0(N) 
lines of £ lie in a single regulus. 

Proof. For each p £ P, we consider the subset £ p C £ given by 

£p = {Lpq}q£P. 

Notice that if q ^ q' , then L pq and L pq / cannot intersect. So the lines of £ p are disjoint. 
From equation I2.3| it follows that the lines of £ p all have different directions. So the lines 
of £ p are pairwise skew, and no two of them lie in the same plane. Therefore, any plane 
contains at most N lines of £. 

The situation for reguli is more complicated because all N lines of £ p may lie in a single 
regulus. But we will prove that this can only occur for at most two values of p. To formulate 
this argument, we define £' := {L pg } ggR 2, so that £ p C £'. 

Lemma 2.9. Suppose that a regulus R contains at least five lines of £' p . Then all the lines 
in one ruling of R lie in £' p . 

Given this lemma, the rest of the proof of Proposition 12.81 is straightforward. If a regulus 
R contains at least five lines of £ p , then all the lines in one ruling of R lie in £' p . But if 
Pi 7^ P2, then £' and £ p2 are disjoint, which we can check from the explicit formula in 
equation 12.31 Since a regulus has only two rulings, there are at most two values of p such 
that R contains > 5 lines of £ p . These two values of p contribute < 2N lines of £ in the 
surface R. The other N — 2 values of p contribute at most 4(iV — 2) lines of £ in the surface 
R. Therefore, the surface R contains at most 2N + 4(iV — 2) < N lines of £. 

Proof of Lemma \2.9l We fix the value of p. We'll check below that each point of R 3 lies in 
exactly one line of £f p . We will construct a non- vanishing vector field V = (V\, V2, V3) on 
R 3 tangent to the lines of £' . Moreover, the coefficients Vi, V2 and V3 are all polynomials 
in (x, y, z) of degree < 2. This construction is slightly tedious, but straightforward. We 
postpone it to the end of the proof. 

The regulus R is defined by an irreducible polynomial / of degree 2. Now suppose that 
a line L pq lies in R. At each point x E L pq , the vector V(x) points tangent to the line 
L pq , and so the directional derivative of / in the direction V{x) vanishes at the point x. 
In other words the dot product V ■ Vf vanishes on the line L pq . Since / has degree 2, the 
dot product V ■ V/ is a degree 2 polynomial. 
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Suppose that R contains five lines of 2l p . We know that / vanishes on each line, and the 
previous paragraph shows that V ■ V/ vanishes on each line. By Bezout's theorem (see 
Lemma l3.ip . / and V ■ V/ must have a common factor. Since / is irreducible, we must 
have that / divides V ■ V/. In other words, V ■ V/ vanishes on the surface R, and so V 
is tangent to R at every point of R. If x denotes any point in R, and we let L be the line 
of £p containing x, then we see that this line lies in R. In this way, we get a ruling of R 
consisting of lines from £' . 

It remains to define the vector field V. We begin by checking that each point (x, y, z) lies 
in exactly one line of £' . By equation 12.31 (x, y, z) lies in L pq if and only if the following 
equation holds for some t. 

+ * , Z+SL , 0) + t( *L^L, 1) _ (x, y, ,). 

Given p and (x, y, z), we can solve uniquely for t and (q x , q y ). First of all, we see that t = z. 
Next we get a matrix equation of the following form: 




= a{x,y,z). 



In this equation, a(x, y, z) is a vector whose entries are polynomials in x, y, z of degree 
< 1. (The polynomials also depend on p, but since p is fixed, we suppress the dependence.) 
Since the determinant of the matrix on the left-hand side is 1 + z 2 > 0, we can solve this 
equation for q x and q y . The solution has the form 

( J ) =(z 2 + ir 1 b(x,y,z). (2.4) 

In this equation, b(x, y, z) is a vector whose entries are polynomials in x, y, z of degree < 2. 

The vector field V(x, y, z) is (z 2 + l)( 9v 2 Pv , Px ~ qx 1 1). Recall that p is fixed, and q x and g y 
can be expressed in terms of (x, y, z) by the equation above. By equation 12.31 this vector 
field is tangent to the line L pq . After multiplying out, the third entry of V is z 2 + 1, so V 
is non- vanishing. Plugging in equation 12.41 for q x and q y and multiplying out, we see that 
the entries of V(x, y, z) are polynomials of degree < 2. □ 

This concludes the proof of Proposition 12.81 □ 

We have now connected the distinct distance problem to the incidence geometry problem 
we mentioned in the introduction. We know that £ consists of iV 2 lines with < N lines in 
any plane or regulus. We now state our two results on incidence geometry. 
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Theorem 2.10. Let £ be any set of N 2 lines in R 3 for which no more than N lie in 
a common plane and no more than 0(N) lie in a common regulus. Then the number of 
points of intersection of two lines in £ is 0(N 3 ). 

Theorem 2.11. Let £ be any set of N 2 lines in R 3 for which no more than N lie in a 
common plane, and let k be a number 3 < k < N. Let & k be the set of points where at 
least k lines meet. Then 

\6k\<N 3 k~ 2 . 

Elekes and Sharir essentially conjectured these two theorems (Conjecture 1 in [ES| ). (The 
difference is that they used different coordinates, so their conjectures are about helices.) 
In the case k = 3, Theorem 12.111 was proven in [EKS . 

Combining these theorems with the coordinates p and Proposition 12. 8| we get bounds for 
\G' k {P)\. Theorem EHH shows that \G' 2 (P)\ < N 3 . Theorem ETQ shows that \G' k (P)\ < 
N 3 k- 2 for 3 < k < N. 

We now prove similar bounds for \G k rans (P)\. These bounds are completely elemen- 
tary 

Lemma 2.12. Let P be any set of N points in R 2 . The number of quadruples in E~ 1 (G trans ) 
is < N 3 . Moreover, \G k rans {P)\ < N 3 k- 2 for all 2 < k < N . 

Proof. Suppose that {pi,P2,P3,P<i) is a distance quadruple in E~ 1 (G trans ). By definition, 
there is a translation g so that g{p\) = P3 and g{p2) = Pi- Therefore, ps — p\ = p^ — p2- 
This equation allows us to determine p^ from Pi,P2,P3- Hence there are < A^ 3 quadruples 
in E~ l (G trans ). 

By Proposition 12.41 we see that 



A /k\ 

\ E -i(G trans )\=j2 2 ( )\Gtr s (p)\- 

k=2 ^ ' 



Noting that \G\ rans {P)\ = £ />fc |G^f ns (P)|, we see that 



N s > \E- 1 {G trans )\ > 2(^yG k rans {P)\. 

This inequality shows that \G k rans {P)\ < N 3 k- 2 for all 2 < k < N. □ 

This substantially ends Section 2 of the paper. To conclude, we give a summary and make 
some comments. 
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The new ingredients in this paper are Theorems 12.101 and 12.111 which we prove in Sections 
3 and 4. These theorems allow us to bound the partial symmetries of P in G': they 
imply that \G' k (P)\ < N 3 k~ 2 for all 2 < k < N. An elementary argument in Lemma 12.121 
shows the same estimates for |Gjf ans (P)|. Combining these, we see that \Gk(P)\ < N s k~ 2 
for 2 < k < N, proving Proposition 12.51 Now the number of quadruples in Q(P) is 
expressed in terms of \Gk(P)\ in equation 12.21 Plugging in our bound for \Gk(P)\, we 
get that |Q(-P)| < iV 3 logiV, proving Proposition 12.21 Finally, the number of distinct 
distances is related to |Q(-P)| by Lemma [2.11 Plugging in our bound for \Q(P)\, we see 
that |d(-P)| > N(log N)^ 1 , proving our main theorem. 

The group G acts as a bridge connecting the original problem on distinct distances to 
the incidence geometry of lines in R 3 . The distance set d(P) is related to the set of 
quadruples Q(P) which is related to the partial symmetries Gk(P), which correspond to 
fc-fold intersections of the lines in £. The group G is a natural symmetry group for the 
problem of distinct distances, but this way of using the symmetry group is new and rather 
surprising. 

Our estimates show that sets with few distinct distances must have many partial sym- 
metries. For example, if Gs(P) is empty, then our results show that |Q(P)| < iV 3 and 
|<i(-P)| > N. Also, any set with |d(P)| < N(log N)~ l l 2 must have a partial symmetry with 
k > exp(clog 1 ^ 2 N), for a universal constant c > 0. Any set with |(i(P)| < iV^logiV) -1 
must have a partial symmetry with k > N c for a universal c > 0. 

3 Flecnodes 

Our goal in this section is to prove Theorem 12.101 We will do this by purely algebraic 
methods following essentially the proof strategy of [GKJ. That is, we will show that an 
important subset of our lines lies in the zero set of a fairly low degree polynomial p. What 
requires a new idea is the next step. We need a polynomial q derived from p with similar 
degree on which the lines also vanish. With that information we will apply a variant of 
Bezout's lemma. 

Lemma 3.1. Let p(x,y, z) and q(x,y, z) be polynomials on R 3 of degrees m and n respec- 
tively. If there is a set of mn + 1 distinct lines simultaneously contained in the zero set of 
p and the zero set of q then p and q have a common factor. 

Thus we will conclude that p and the derived polynomial q must have a common factor 
and we will arrive at some geometrical conclusion from this based on the way that q was 
derived. In the paper |GK| . the derived polynomials that we used were the gradient of p and 
the algebraic version of the second fundamental form of the surface given by p = 0. These 
were good choices because when three or more lines were incident at each point, we knew 
on geometric grounds that one or the other would vanish at each point, because the point 
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would be either critical or flat. However, here we are faced with points at which only two 
lines intersect, and so we must make a more clever choice of the derived polynomial. 

We begin with the definition of a flecnode. Given an algebraic surface in R 3 given by the 
equation p(x, y, z) = where p is a polynomial of degree d at least 3, a flecnode is a point 
(x, y, z) where a line agrees with the surface to order three. To find all such points, we 
might solve the system of equations: 

p(x,y,z)=0; V v p{x, y, z) = 0; V%p(x, y, z) = 0; V^p(x, y, z) = 0. 

These are four equations for six unknowns, (x, y, z) and the components for the direction 
v. However the last three equations are homogeneous in v and may be viewed as three 
equations in five unknowns (and the whole system as 4 equations in 5 unknowns.) We may 
reduce the last three equations to a single equation in three unknowns (x,y,z). We write 
the reduced equation as 

Fl{p){x,y,z) = 0. 

The polynomial Fl(p) is of degree lid — 24. It is called the flecnode polynomial of p and 
vanishes at any flecnode of any level set of p. (See [Salmj Art. 588 pages 277-78.) 

The term flecnode was apparently first coined by Cayley. The polynomial Fl(p) was discov- 
ered by the Rev. George Salmon, but its most important property to us was communicated 
to him by Cayley. 

Proposition 3.2. The surface p = is ruled if and only ifF\(p) is everywhere vanishing 
on it. 

An algebraic surface (in R 3 ) is ruled if it contains a line passing through every point. The 
set of all lines contained in an algebraic surface (of some degree N) is an algebraic set of 
lines. (This is because a line is contained in the surface if and only if it is contained to 
order N + 1 at one of its points. So a line is contained in the surface if and only if N + 1 
polynomial equations in the parameters of the line are satisfied.) The set of lines contained 
in a surface may have two dimensional components, one-dimensional components and zero- 
dimensional components. It is easy to see that an algebraic surface in R 3 contains a two 
dimensional set of lines only if it has a plane as a factor. (The way to see this is to find 
a regular point of the surface with an infinite number of lines going through it. Then the 
surface must contain the tangent plane to this point.) Thus an algebraic surface which 
is ruled and plane-free will contain both a 1-dimensional set of lines (the generators) and 
possibly a 0-dimensional set of lines. A detailed classical treatment of ruled surfaces is 
given in [Salmj Chapter XIII Part 3. 

One important example of a ruled surface is a regulus. A regulus is actually doubly-ruled: 
every point in the regulus lies in two lines in the regulus. A ruled surface is called singly- 
ruled if a generic point in the surface lies in only one line in the surface. (Some points in 
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a singly-ruled surface may lie in two lines.) Except for reguli and planes, every irreducible 
ruled surface (in R 3 ) is singly-ruled. (See the explanation in Section 1). 

One direction of Proposition 13.21 is obvious. If the surface is ruled, there is a line contained 
in the surface at every point. If the line is contained in the surface, it certainly agrees 
to order 3. The reverse direction is more computational. It is indicated in a footnote to 
(Salmi Art - 588 P a g e 278 - 0ne 

sees that setting Fl(p) = is a way of rewriting a differential 
equation on p which implies ruledness. Proposition 13.21 was used in a famous paper of Segre 
|Seg . For a generalization to manifolds in higher dimensions see [Land]. 



An immediate corollary of the proposition is 

Corollary 3.3. Let p = be a degree d hypersurface in H d . Suppose that the surface 
contains more than lid 2 — 24d lines. Then p has a ruled factor. 

Proof. By lemma \3A\ since both p and Fl(p) vanish on the same set of more than lld 2 -24d 
lines, they must have a common factor q. Since q is a factor of p and Fl(p) vanishes on the 
surface q = 0, it must be that at every regular point of the surface q = 0, there is a line 
which meets the surface to order 3. Thus Fl(g) = which implies by Proposition 13.21 that 
q is ruled. □ 

Now we would like to consider ruled surfaces of degree less than N. Thus our surfaces are 
the sets 

p{x,y,z) = 

for a polynomial p (which we may choose square free) of degree less than N. We may 
uniquely factorize the polynomial into irreducibles: 

P = P1P2 ■■■Pm- 

We say that p is plane-free and regulus-free if none of the zero sets of the factors is a plane 
or a regulus. Thus if p is plane-free and regulus-free, the zero-set of each of the factors is 
an irreducible algebraic singly-ruled surface. We now state the main geometrical lemma 
for proving Theorem 12. 101 

Lemma 3.4. Let p be a polynomial of degree less than N so that p = is ruled and so 
that p is plane-free and regulus-free. Let £i be a set of lines contained in the surface p = 
with < ./V 2 . Let Q\ be the set of points of intersection of lines in Then 

\Qi\<N\ 



Before we begin in earnest the proof of Lemma 13.41 we will nail down a few delicate points 
of the geometry of irreducible singly-ruled surfaces. 

We let p(x, y, z) be an irreducible polynomial so that p(x, y, z) = is a ruled surface which 
is not a plane or a regulus. In other words, the surface S = {(x,y,z) : p(x,y,z) = 0} is 
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irreducible and singly-ruled. We say that a point (xq,i/q,zq) G S is an exceptional point 
of the surface, if it lies on infinitely many lines contained in the surface. We say that 
a line I contained in S is an exceptional line of the surface if there are infinitely many 
lines in S which intersect I at non-exceptional points. We prove a structural lemma about 
exceptional points and exceptional lines of irreducible singly-ruled surfaces. 
Lemma 3.5. Let p(x, y, z) be an irreducible polynomial. Let S = {(x, y, z) : p(x, y, z) = 0} 
be an irreducible surface which is neither a plane nor a regulus. 

1. Let (xo,yo,ZQ) be an exceptional point of S. Then every other point (x,y,z) of S is 
on a line I which is contained in S and which contains the point (xo, yo, zq). 

2. Let I be an exceptional line of S. Then there is an algebraic curve C so that every 
point of S not lying on C is contained in a line contained in S and intersecting I. 

We proceed to give an elementary proof of Lemma 13.51 

Proof. To prove the first part, we observe that by a change of coordinates we can move 
(xo, yo, zo) to the origin. We let Q be the set of points q different from the origin so that 
the line from q to the origin is contained in S. We observe that Q is the intersection of an 
algebraic set with the complement of the origin. That is, there is a finite set of polynomials 
E so that a point q different from the origin lies in Q if and only if each polynomial in E 
vanishes at q. This is because if d is the degree of p, to test whether q £ Q, we need only 
check that the line containing q and the origin is tangent to S to degree d + 1 at q. Now 
by assumption, the zero set of each polynomial in E contains the union of infinitely many 
lines contained in S. Thus by Lemma 13.11 and by the irreducibility of p, it must be that 
each polynomial in E has p as a factor. Therefore Q is all of S except the origin. We have 
proved the first part. 

Now to prove the second part, we observe that by a change of coordinates, we may choose 
I to be the coordinate line y = 0; z = 0. We let Q be the set of points q not on / so that 
there is a line from q to a non-exceptional point of I which is contained in S. We would like 
to claim that Q is the intersection of an algebraic set with the complement of an algebraic 
curve. If we are able to show this, we will prove the second claim in the same way that we 
proved the first. To do this, for points (x,y,z) on S outside of an algebraic curve, we will 
identify the point at which the line containing (x,y,z) intersects /. 

Consider a point (x, y, z) on S for which y, z) ^ 0. In particular, the point (x, y, z) is 
a regular point of S. Since ^(x,y,z) ^ 0, there is a unique point (x',0, 0) of / which lies 
in the tangent plane to S at the point (x,y,z). In fact, we can solve for x' as a rational 
function of (x, y, z) with only the polynomial ^ in the denominator. Thus we can find 
a set E of rational functions having only powers of ^ in their denominators, so that for 
any (x,y,z) at which ^ does not vanish, we have that (x,y,z) S Q if and only if, every 
function in E vanishes on (x,y,z). 
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In order for the previous paragraph to be useful to us, we need to know that does not 
vanish identically on S. Suppose that it did. Since || is of lower degree than p and p 
is irreducible, it must be that ^ vanishes identically as a polynomial so that p depends 
only on y and z. In this case, since S contains I and it contains a line V intersecting I, 
it must contain all translates of I' in the x-direction. Thus it contains a plane which is a 
contradiction. 

Thus, we let C be the algebraic curve where both p and ^2 vanish. Away from C, there 
is a finite set of polynomials F (which we obtain from E by multiplying by a large enough 
power of ||) so that a point (x, y, z) of S outside of C is in Q if and only if each polynomial 
in F vanishes at (x,y,z). Since we know that p is irreducible and Q contains an infinite 
number of lines, it must be that each polynomial in F has p as a factor. Thus every point 
of S which is outside of C lies in Q which was to be shown. 

□ 

Now that we have established our structural result, Lemma 13.5} we may use it to obtain a 
corollary giving quantitative bounds on the number of exceptional points and lines. 
Corollary 3.6. Let p(x,y, z) be an irreducible polynomial. Let S = {(x,y,z) :p(x,y,z) = 
0} be an irreducible surface which is neither a plane nor a regulus. Then S has at most 
one exceptional point and at most two exceptional lines. 

We now prove Corollary 13.61 

Proof. Let (xo,yo, Zq) and (xi, yi, Z\) be distinct exceptional points of S. Since S is singly- 
ruled, the generic point of S is contained in only a single line I contained in S. Thus 
by Lemma [3.51 if the point is different from (xo,yo>-zo) and (xi,yi, z\), this line I must 
contain both (xo,yo,zo) and (xi,y±,Zi) . But there is only one such line, and that is a 
contradiction. 

Now let li,h,h De exceptional lines of 5. There are curves C\,C2, and C3 so that the 
generic point in the complement of C\, C2, and C3 lies on only one line contained in S and 
this line must intersect each of l\, I2, and £3. Thus there are infinitely many lines contained 
in S which intersect each of and Z3. (Moreover, since the lines are exceptional, there 
must be an infinite set of lines which intersect the three away from the possible three points 
of intersection of any two of h,l2, and Z3.) If the any two of the three lines are coplanar, 
this means there is an infinite set of lines contained in S which lie in one plane. This 
contradicts the irreducibility and nonplanarity of 5. If contrariwise, the three lines Zi, Z2, 
and Z3 are pairwise skew, then the set of all lines which intersect all three are one ruling of 
a regulus. In this case, S contains infinitely many lines of a regulus which contradicts the 
fact that S is irreducible and not a regulus. 

□ 
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For context, we remark that an irreducible singly-ruled surface with an exceptional point 
is often referred to as a cone and the exceptional point is referred to as the cone point. 
Irreducible ruled surfaces with two exceptional lines do exist: one way of constructing a 
ruled surface with two exceptional lines is to choose a curve in the two-dimensional set of 
lines which intersect a pair of skew lines. 

At last, we may begin the proof of Lemma 13.41 

Proof. We say that a point (x, y, z) is exceptional for the surface p = 0, if it is exceptional 
for pj = where pj is one of the irreducible factors of p. We say that a line I is exceptional 
for the surface p = if it is exceptional for pj = where pj is one of the irreducible factors 
of p. Thus, in light of Corollary 13. 61 there are no more than iV exceptional points and 2N 
exceptional lines for p = 0. Thus there are < N 3 intersections between exceptional lines 
and lines of £i. Thus to prove the lemma, we need only consider intersections between 
nonexceptional lines of £i at nonexceptional points. 

We note that any line contained in a ruled surface which is not a generator must be an 
exceptional line since each point of the line will have a generator going through it. (The 
definition of a ruled surface is that every point lies in a line in the surface. Since there 
are only finitely many non- generators, almost every point must lie in a generator. But in 
fact every point lies in a generator by a limiting argument. Let q be a point in the ruled 
surface and let qi be a sequence of points that converge to q with qi lying in a generator 
li. By taking a subsequence, we can arrange that the directions of the li converge, and so 
the lines li converge to a limit line I which contains q and lies in the surface. This line is a 
limit of generators, and so it is a generator.) 

Let I be a non-exceptional line in the ruled surface. In particular I is a generator. We claim 
that there are at most N — \ non-exceptional points in I where I intersects another non- 
exceptional line in the ruled surface. This claim implies that there are at most (N — 1)N 2 
non-exceptional points where two non-exceptional lines intersect, proving the bound we 
want. 

To prove the claim, we repeat an argument found in [Salm| Art 485 pages 88-89. Choose 
a plane ir through the generator I. The plane intersects the surface in a curve of degree 
N. One component is the generator itself. The other component is an algebraic curve c of 
degree N — 1. There are at most N — 1 points of intersection between I and c. Suppose 
that I' is another non-exceptional line and that I' intersects I at a non-exceptional point q. 
It suffices to prove that q lies in the curve c. Since V is a generator, it lies in a continuous 
1-parameter family of other generators. Consider a small open set of generators around V . 
These generators intersect the plane n. So each of them intersects either I or c. Since q is 
non-exceptional, only finitely many of them intersect q. Since there are only finitely many 
exceptional points, we can arrange that each generator in our small open set intersects it 
in a non-exceptional point. Since I is non-exceptional, only finitely many of our generators 
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can intersect I. Therefore, almost all of our generators must intersect c. This is only 
possible if q lies in c. □ 



Now we are ready to begin the proof of Theorem 12.101 We assume we have a set £ of at 
most N 2 lines for which no more than N lie in a plane and no more than N lie in a regulus. 
We suppose, by way of contradiction, that for Q, a positive real number sufficiently large, 
there are QN 3 points of intersection of lines of £ and we assume that this is an optimal 
example, so that for no M < N do we have a set of M 2 lines so that no more than M lie in 
a plane and no more than M lie in a regulus is it the case that there are more than QM 3 
intersections. (N need not be an integer.) 

We now apply a degree reduction argument similar to the one in |GK| . We let £' be the 
subset of £ consisting of lines which intersect other lines of £ in at least different 

points. The lines not in £' participate in at most points of intersection. Thus there 

are at least 9< ^ points of intersection between lines of £'. We define a number a with 
< a < 1 so that £' has aN 2 lines. 

Now we select a random subset £" of the lines of £' choosing lines independently with 
probability ^22.. With positive probability, there will be no more than 200 q N lines in £" 

and each line of £' will intersect lines of £" in at least N different points. Now pick 
points on each line of £". (R is a constant which is sufficiently large but universal.) Call the 

3 3 

set of all of the points 6. There are 0(^^i — ) points in (3, so we may find a polynomial p 

i i 

of deg ree 0( — ^ — ) which vanishes on every point of &. With R sufficiently large, p must 

vanish identically on every line of £". Since each line of £' meets £" at N different points, 
it must be that p vanishes identically on each line of £'. Thus ends the degree reduction 
argument and we will now study the relatively low degree polynomial p. 

We may factor p = p\P2 where p\ is the product of the ruled irreducible factors of p and p2 

i 

is the product of unruled irreducible factors of p. Each of p\ and p2 is of degree 0( a2 ^ ). 

Q 7 

(We have suppressed the R dependence since R is universal.) We break up the set of lines 
of £' into the disjoint subsets £i consisting of those lines in the zero set of p\ and £2 
consisting of all the other lines in £'. 

There are no more than 0(N 3 ) points of intersection between lines of £1 and £2 since each 

1 

line of £2 contains no more than 0( a ^^ ) points where p\ is zero. Thus we are left with two 

(not mutually-exclusive) cases which cover all possibilities. There are either ^q— points 
of intersection between lines of £1 or there are 3< ?^ points of intersection between lines 
of £2. We will handle these separately. 
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Suppose there are " intersections between lines of £1. We factor p\ = P3P4, where P3 
is plane-free and regulus-free and p^ is a product of planes and reguli. We break £1 into 
disjoint sets £3 and £4, with £3 consisting of lines in the zero set of p^ and £4 consisting of 
all other lines of £1. As before there 0(N 3 ) points of intersection between lines of £3 and 
£4 since lines of £4 are not in the zero set of p%. Moreover there are at most 0(iV 3 ) points 
of intersection between lines of £4 because they lie in at most N planes and reguli each 
containing at most N lines. (We just see that each line has at most 0(N) intersections with 
planes and reguli it is not contained in and there are at most 0(N 2 ) points of intersection 
between lines internal to each plane and regulus.) However there cannot be more than 
0(N 3 ) points of intersection between lines of £3 by applying the key lemma \3M (Here we 
used that p$ is plane-free and regulus-free.) 

Thus we must be in the second case, where many of the points of intersection are between 
lines of £2, all of which lie in the zero set of P2 which is totally unruled. Recall that P2 is of 

1 r2 

degree 0( a2 i )• Thus by Corollary 13.31 its zero-set contains no more than 0(^q~) lines. 

We would like to now invoke the fact that the example we started with was optimal and 
reach a contradiction. But we can't quite do that. Our set £2 has f3N 2 lines with (3 = O(^) 
and we only know that there are no more than iV lines in any plane or regulus, whereas 
we need to know that there are no more than y/fiN lines. If this is the case we are done. 
If not we construct a subset £5 as follows. If there is a plane or regulus containing more 
than \/]3N lines of £2, we put those lines in £5 and remove them from £2. We repeat as 
needed labelling the remaining lines £6- Since we removed O(N) planes and reguli, there 
are 0(N 3 ) points of intersection between lines of £5. Since no lines of £6 belong to any 
plane or regulus of £5 there are fewer than 0(iV 3 ) points of intersection between lines of 
£5 and £6- Now we apply optimality of our original example to rule out more than 0(-^-) 

points of intersection between lines of £6- Thus we have reached a contradiction. 



4 Cell decompositions 

In this section, we construct a new type of cell decomposition of R n , where the walls of 
the cells are the zero set of a polynomial. We use this type of cell decomposition to prove 
an incidence theorem for lines in R 3 when not too many lines lie in a plane. The cell 
decomposition is described in the following theorem. 

Theorem 4.1. If & is a set of S points in R n and J > 1 is an integer, then there is 
a polynomial surface Z of degree d < 2 J / n with the following property. The complement 
R n \ Z is the union of 2 J open cells Oi, and each cell contains < 2~ J S points of &. 

Remark: Some or all of the points of & may lie inside the surface Z. Recall that Z is not 
part of any of the open sets Oi . So there are two extreme cases in Theorem 14.11 In one 
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extreme, all the points of & lie in the open cells Oi, and there are exactly 2~ S points in 
each cell. In the other extreme, all the points of (3 lie in the surface Z. When the points 
all lie in Z, the theorem does not give any information about where in Z they lie. 

The proof of Theorem 14. II is based on the polynomial ham sandwich theorem of Stone and 
Tukey [ST] . For context, we first recall the original ham sandwich theorem. 
Theorem 4.2. (Ham sandwich theorem) If U\, U n C R n are finite volume open sets, 
then there is a hyperplane which bisects each set Ui. 

The ham sandwich theorem was proven in the case n = 3 by Banach in the late 30's, 
using the Borsuk-Ulam theorem. In 1942, Stone and Tukey generalized Banach's proof 
to all dimensions. They also observed that the same argument applies to many other 
situations. In particular, they proved the following polynomial version of the ham sandwich 
theorem. 

We say that an algebraic hypersurface p(x\, . . . , x n ) = bisects a finite volume open set U 
if 

Voi(u n{ P < o}) = Voi(u n{ P > o}) = (i/2)Voi(u). 

Theorem 4.3. (Stone- Tukey, L r STl) For any degree d > 1, the following holds. Let 
Ui, . . . ,Um be any finite volume open sets in R n , with M = ( n ^ d ) — 1. Then there is 
a real algebraic hypersurface of degree at most d that bisects each Ui. 

(For a recent exposition of the proof, see [G].) 

We now adapt Theorem 14.31 to finite sets of points. Instead of open sets Ui, we will have 
finite sets Si. We say that a polynomial p bisects a finite set S if at most half the points in 
S are in {p > 0} and at most half the points in S are in {p < 0}. Note that p may vanish 
on some or all of the points of S. 

Corollary 4.4. Let Si,... ,Sm be finite sets of points in R n with M = ( n ^ d ) — 1. Then 
there is a real algebraic hypersurface of degree at most d that bisects each Si. 

Proof. For each 5 > 0, define Ui t s to be the union of 5-balls centered at the points of Si. By 
the polynomial ham sandwich theorem, Theorem 14. 3| we can find a non-zero polynomial 
PS of degree < d that bisects each set U^s. 

We want to take a limit of the polynomials ps as 5 — > 0. To help make this work, we pick 
a norm |||| on the space of polynomials of degree < d. Any norm will do - to be definite, 
let ||p|| denote the maximal absolute value of the coefficients of p. By scaling pg, we can 
assume that ||p^|| = 1 for all 5. Now we can find a sequence 5 m — > so that ps m converges 
in the space of degree < d polynomials. We let p be the limit polynomial and observe that 
||p|| = 1. In particular, p is not 0. Since the coefficients of ps m converge to the coefficients 
of p, it's easy to check that p$ converges to p uniformly on compact sets. 



20 



We claim that p bisects each set Si. We prove the claim by contradiction. Suppose instead 
that p > on more than half of the points of Si. (The case p < is similar.) Let S^~ C Si 
denote the set of points of Si where p > 0. By choosing e sufficiently small, we can assume 
that p > e on the e-ball around each point of Sf. Also, we can choose e small enough 
that the e-balls around the points of Si are disjoint. Since ps m converges to p uniformly 
on compact sets, we can find m large enough that p$ m > on the e-ball around each point 
of Sf. By making m large, we can also arrange that S m < e. Therefore, ps m > on the 
5 m -ball around each point of S^~ . But then ps m > on more than half of Ui t g m . This 
contradiction proves that p bisects Si. □ 

Using this finite polynomial ham sandwich theorem, we can quickly prove Theorem l4.1i 

Proof of Theorem \4-l\ We do the construction in J steps. In the first step, we pick a 
linear polynomial p\ that bisects &. We let & + and &~ be the sets where p\ is positive 
and negative, respectively. In the second step, we find a polynomial p2 that bisects & + 
and <3~. And so on. At each new step, we use Corollary 14.41 to bisect the sets from the 
previous step. 

We now describe the inductive procedure a little more precisely. At the end of step j, we 
have defined j polynomials pi,...,pj. We define 2 J subsets of & by looking at the points 
where the polynomials p\, ...,Pj have specified signs. Then we use Corollary 14.41 to bisect 
each of these 2 3 sets. It follows by induction that each subset contains < 2~ J S points. 

Finally, we let p be the product p±...pj, and we let Z denote the zero set of p. 

First we estimate the degree of p. By Corollary 14. 4| the degree of pj is < 2 J / n . Hence the 
degree of p is d < £/=i 2*> < 2 J / n . 

Now we define the 2 J open sets Oi as the sets where the polynomials p%, ...,pj have specified 
signs. For example, one of the sets Oi is defined by the inequalities p\(x) > 0,p2(x) < 
0,P3(x) > 0, ...,pj(x) > 0. The sets Oi are open and disjoint. Their union is exactly the 
complement of Z. As we saw above, the number of points in & n Oi is at most 2~ J S. □ 

Using this type of cell decomposition, we will prove an estimate for incidences of lines when 
not too many lines lie in a plane. 

Theorem 4.5. Let k > 3. Let SI be a set of L lines in R 3 with at most B lines in any 
plane. Let 6 be the set of points in R 3 intersecting at least k lines of £. Then the following 
inequality holds: 

\e\ < C[L 3/2 k- 2 + LBkT z + LAT 1 ]. 
Theorem 14.51 implies Theorem 12 . 1 1 1 by setting L = iV 2 and B = N. 
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This theorem is sharp up to constant factors in a number of cases. These examples help 
to give a sense of the right-hand side. 

Example 1. Choose L/k points. Let £ consist of k lines through each point. The set £ 
has a k-fold incidence at each of the L/k points. (We can also arrange that no three lines 
lie in a plane.) 

Example 2. Choose L/B planes. Put B lines in each of the planes. The B lines in each 
plane can be arranged to create B 2 k~ s k-fold incidences. (See the examples in |SzTj .) This 
set of lines has a total of LBk~ s k-fold incidences. 

Example 3. Let Go denote the integer lattice {(a,b,0)} with 1 < a,b < L 1 / 4 . Let G\ 
denote the integer lattice {(a, b, 1)} with 1 < a,b < L 1//4 . Let £ denote all the lines from 
a point of Go to a point of G\. The horizontal planes z = and z = 1 do not contain any 
lines of £. Any other plane contains at most L 1 / 4 points of each Gj, and so at most L 1 / 2 
lines of £. We will prove in the appendix that there are ~ L 3 / 2 k~ 2 points that lie in > k 
lines of £ for each k in the range 2 < k < L 1 / 2 /400. 

For context, we should compare Theorem 14.51 to the Szemeredi- Trotter theorem, which 
holds in all dimensions as we now recall. 

Theorem 4.6. If £ is a set of L lines in R n ; and 6 denotes the set of points lying in at 
least k lines of £, then 

\&\ <L 2 k- 3 + Lk-\ 

The higher-dimensional case follows easily from the two-dimensional case by taking a 
generic projection from R n to R 2 . The set of lines £ will project to L distinct lines 
in R 2 , and the points of S project to distinct points in R 2 . 

Theorem 14.51 is a refinement of Theorem 14.61 When B = L, Theorem 14.51 is Theorem 14.61 
Theorem 14.51 tells us how much we can improve the Szemeredi- Trotter theorem if we know 
in addition that not too many lines lie in a plane. 

We will use Theorem 14.61 in our proof. Recently, in [KMS], Kaplan, Matousek, and 
Sharir gave a new proof of the Szemeredi- Trotter theorem using polynomial cell decompo- 
sitions. 

Now we turn to the proof of Theorem 14.51 An important special case is the uniform case 
where each point has ~ k lines through it and each line contains about the same number 
of points. We will first prove the theorem under some uniformity hypotheses. 
Proposition 4.7. Let k > 3. Let £ be a set of L lines in R 3 with at most B lines in any 
plane. Let & be a set of S points in R 3 so that each point intersects between k and 2k 
lines of £. 
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Also, we assume that there are > jqqL lines in £ which each contain > j^SkL points 
of 6. 

Then S < C[L 3 / 2 k~ 2 + LBk~ 3 + Lk' 1 }. 

The second paragraph of Proposition 14. 71 is a uniformity assumption about the lines. Note 
that there are ~ Sk total incidences between lines of £ and points of 6. Therefore, an 
average line of £ contains ~ SkL^ 1 points of 6. We assume here that there are many lines 
that are about average. Proposition 14.71 is the main part of the proof of Theorem 14.51 The 
general case reduces to this special case by easy inductive arguments. 

Proof. We begin by outlining our strategy. We suppose that 

S > AL 3 / 2 k- 2 + Lk- 1 . (4.1) 

In this equation, A represents a large constant that we will choose below. Assuming 14.11 
we need to show that many lines of £ lie in a plane. In particular, we will find a plane that 
contains > SL~ l k 3 lines of £. This means that B > SL~ l k?, and hence S < BLk~ 3 , and 
we will be done. 

Let us outline how we find the plane. First we prove that a definite fraction of the lines of 
£ lie in an algebraic surface Z of degree < L 2 S~ l k~ 3 . Second we prove that this variety Z 
contains some planes, and that a definite fraction of the lines of £ lie in the planes. Since 
there are at most d planes, one plane must contain > L/d lines. Because d < L 2 S^ 1 ^ 3 , 
this plane contains > SL _1 A; 3 lines, which is what we wanted to prove. 

Our bound for the degree d is sharp up to a constant factor because of Example 2 above. 
In this example, the lines £ lie in ~ L 2 S~ l k~ 3 planes. Since the planes can be taken in 
general position, the lines £ do not lie in an algebraic surface of lower degree. 

(Our bound for the degree d is the new ingredient in this section. We will find the algebraic 
surface Z by using the polynomial cell decomposition of Theorem 14.11 We initially tried to 
find Z by using the purely algebraic degree reduction argument from |GK] , as in Section 
3. With this method, we proved that a definite fraction of the lines of Z lie in an algebraic 
surface of degree L 2 S~ l k~ 2 . But this degree is too large to make our argument work.) 

Now we begin the detailed proof of Proposition 14.71 

First we prove that almost all points of & lie in a surface Z with controlled degree. This 
lemma is the most important step in the proof of Theorem 14.51 

Lemma 4.8. // the constant A in inequality \4-l\ is sufficiently large, then there is an 
algebraic surface Z of degree < L 2 S~ l k~ 3 that contains at least (1 — lO" 8 )^ points of 6. 
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Proof. We let 9 denote a large constant which we will choose later, and we let d be the 
greatest integer less than 9L 2 S~ 1 k~ 3 . This d will be the degree of our surface Z. First 
we check that d > 1. By the Szemeredi- Trotter theorem, S < L 2 k~ 3 + LA; -1 . But by 
inequality 14,11 S > Lk^ 1 . Therefore, S < L 2 k~ 3 . Hence we can choose 6 so that d > 1. 

Now we apply Theorem 14.11 to construct a degree d surface Z such that R 3 \ Z is a union 
of ~ d 3 open cells Oi, each containing < Sd~ 3 points of &. 

Let us suppose that Z contains < (1 — 10 _8 )5 points of 6. So the open cells Oi all together 
contain > 10 -8 ,S points of &. Since each cell contains < Sd~ 3 points of 6, there must be 
> d 3 cells that each contain > Sd~ 3 points of (3. We call these full cells. 

We now prove an upper bound for S using the cellular method from [CEG SWj. 

We let £(Oi) denote the subset of lines of £ which intersect Oj. We let L ce u be the minimum 
of |£(Oj)| among all the full cells Oi. We apply the Szemeredi- Trotter inequality to the 
full cell with the fewest lines. Since this full cell still contains > Sd~' i points, we get the 
following inequality. 

Sd- 3 <L 2 cell k-* + L ceU k-\ 

Next we estimate L ce u in terms of the degree of Z. A line either lies in Z or else it intersects 
Z at most d times. Every time a line moves from one open cell Oi to another, it needs to 
pass through Z. So each line of £ intersects at most d + 1 cells Oi. So there are < L(d+ 1) 
pairs (l,Oi) where / G £(0?.)- But there are ~ <i 3 full cells Oi. Hence L ce n < LdT 2 . 
Plugging in this estimate for L ce u we get the following inequality. 

Sd' 3 < L 2 d- A k~ 3 + Ld- 2 k- 1 . 
Recalling that d ~ 6L 2 S- l k- 3 and rearranging, we get the following inequality. 

S < cie^s + 9L 3 S~ 1 k- 4 ). 

Note that the constant C does not depend on 9. (We could work it out explicitly using an 
explicit constant in Theorem 14. II and in the Szemeredi- Trotter theorem.) At this point, we 
choose 9 sufficiently large so that C9~ x < 1/2. We can then move the term C9~ 1 S to the 
left-hand side, and rearrange to get the inequality 

S < 9 l / 2 L 3 ' 2 k- 2 . 
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If the constant A is sufficiently large, this inequality contradicts 14.11 We conclude that 
there are less than 10 _8 S points of 6 outside of Z. 

Finally, the degree of Z is d < etfS^k- 3 . The constant 6 is a particular number that we 
chose above. In particular 9 does not depend on A. And so d < L 2 S -1 /c -3 as desired. □ 

We let &z denote the points of & that lie in Z. By Lemma 14.81 I© \ ©z| < 10~ 8 S\ Our 
next goal is to prove that many lines of £ lie in the surface Z. This result depends on 
a quick calculation about the degree d. Recall that an average line of £ contains SkL^ 1 
points of &. We prove that the degree d is much smaller than SkL^ 1 . 

Lemma 4.9. If the constant A is sufficiently large, then 

d < 1Q-* SkL- 1 . 
Proof Inequality 14.11 can be rewritten as 

1 < A^SL-^k 2 . 

Squaring this, we see that 

d < dA- 2 S 2 L~ 3 k 4 < A~ 2 SkL~ l . 
Now choosing A sufficiently large finishes the proof. □ 
As an immediate corollary, we get the following lemma. 

Lemma 4.10. If I is a line of £ that contains at least 10 -8 SkL~ l points of&z, then I is 
contained in Z . 

Proof. The line / contains at least 1CP 8 SkL^ 1 points of Z. Since d > 1CP 8 SkL^ 1 , the line 
I must lie in the surface Z. □ 

Let £z denote the set of lines in £ that are contained in Z. 
Lemma 4.11. The set £z contains at least (1/200)1/ lines. 

Proof. We assumed that there are > (1/100)L lines of £ which each contain > (l/100)SkL~ ] 
points of (3. Let £o C £ be the set of these lines. We claim that most of these lines lie in 
SjZ- Suppose that a line I lies in £o \ £z- It must contain at least (l/W0)SkL~ 1 points 
of S. But by Lemma 14.101 it contains < lO^SfcL -1 points of &z- Therefore, it must 
contain at least (1/200)5A;L _1 points of & \ &z- This gives us the following inequality. 
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(1/200) SkL' 1 ]^ \ £ z \ < 1(6 \ 6 Z , £ \ 

Here we write / to abbreviate the number of incidences between a set of points and a set 
of lines. 

On the other hand, each point of (3 lies in at most 2k lines of £, giving us an upper bound 
on incidences: 

/(6\6 z ,£ \£z) < 2k\e\e z \ < 2-W- 8 Sk. 

Comparing these two inequalities, we see that |£o \ £z| < 4 • 10~ 6 L, which implies that 
|£z| > (1/200)L. □ 

We have now carried out the first step of our outline: we found a surface Z of degree 
< L 2 S ,_1 fc -3 which contains a definite fraction of the lines from £. 

We now turn to the second step of our outline. We will prove that Z contains some planes, 
and that these planes contain many lines of £. This step is closely based on the techniques 
in |GK| and |EKS] . The paper |EKS] contains a clear introduction to the techniques. 
In particular, Section 2 of [EKSj proves all of the fundamental lemmas from algebraic 
geometry that we need. 

Each point of G z lies in at least k lines of £. But such a point does not necessarily lie in 
any lines of £z- Therefore we make the following definition. 

We define & z to be the set of points in 6^ that lie in at least three lines of £ z . 

This subset is important because each point of & z is a special point of the surface Z: 
either a critical point or a flat point. Let's recall the definitions of critical points and flat 
points. 

The surface Z is the vanishing set of a polynomial p. The polynomial p can be factored into 
irreducible polynomials p = p\p%.... We assume that each irreducible factor of p appears 
only once. Now a point x G Z is called critical if the gradient Vp vanishes at x. If x G Z is 
not critical, we say that x is regular. In a small neighborhood of a regular point x G Z, Z is 
a smooth submanifold. We say that a regular point x G Z is flat if the second fundamental 
form of Z vanishes at x. 

Lemma 4.12. Each point of & z is either a critical point or a flat point of Z . 

Proof. Let x G & z . By definition, x lies in three lines which all lie in Z. If x is a critical 
point of Z, we are done. If x is a regular point of Z, then all three lines must lie in the 
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tangent space of Z at x. In particular, the three lines are coplanar. Let Vi, 1)2,1)3 be non- 
zero tangent vectors of the three lines at x. The second fundamental form of Z vanishes in 
each of these three directions. Since the second fundamental form is a symmetric bilinear 
form on the 2-dimensional tangent space, it must vanish. Therefore, x is a flat point of 
Z. □ 

(See also [EKS], Proposition 4 and Proposition 6 for a more detailed proof.) 

Lemma 14.121 shows that the points of & z are important. Next we show that almost every 
point of (3 lies in & z . 

Lemma 4.13. The set 6 \ & z contains at most 10 S points. 
Proof. Lemma EES] tells us that |6 \ & z \ < 1CT 8 S. 

Suppose x is a point in Gz \ &%• The point x lies in at least k lines from £, but it lies in 
at most two lines from £z- So x lies in > k — 2 lines of £ \ £z. 

(* - 2)\& z \&' z \< I{&z \ e' z , £ \ £z). 

On the other hand, we showed in Lemma f4. lOl that each line of £\£z contains < 10 -8 SkL^ 1 
points of &z- Therefore 

I(& z \ e'z, £ \ £z) < I(&z, £ \ £z) < (W^SkL-^L. 
Combining these inequalities, and recalling that k > 3, we see that 

\&z \ &'z\ < 10~ s — — S < 3 • 1(T 8 S. 
k — 2 

□ 

We let & cr it C & z denote the critical points in & z and we let 6 fi a t C & z denote the flat 
points of & z . We call a line I C Z a critical line of Z if every point of I is a critical point 
of Z. We call a line I C Z a flat line if it is not a critical line and every regular point in I 
is flat. 

Our next goal is to show that Z contains many flat lines, which is a step to showing that 
Z contains a plane. In order to do this, we show that the flat points of Z are defined by 
the vanishing of certain polynomials. 
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Lemma 4.14. Let x be a regular point of Z. Then x is flat if and only if the following 
three polynomial vectors vanish at x: 

V ejX VpVpxVp,j = 1,2,3. 

Here, ej are the coordinate vectors of R 3 , and x denotes the cross product of vectors. 
Each vector above has three components, so we have a total of nine polynomials. Each 
polynomial has degree < 3d. For more explanation, see Section 3 of |GK] or Section 2 of 
[EKS]. In [EKSj . they use a more efficient set of polynomials: only three polynomials. 

To find critical or flat lines, we use the following simple lemmas. 

Lemma 4.15. Suppose that a line I contains more than d critical points of Z. Then I is 
a critical line of Z. 

Proof. At each critical point of Z, the polynomial p and all the components of Vp vanish. 
Since p has degree d, we conclude that p vanishes on every point of I. Since Vp has degree 
d—1, we conclude that Vp vanishes on every point of I. Hence I is a critical line of Z. □ 

Lemma 4.16. Suppose that a line I contains more than 3d flat points of Z. Then I is a 
flat line of Z. 

Proof. Let x\, x^d+i be flat points of Z contained in I. By Lemma [4. 141 each polynomial 
V ejX VpVp x Vp vanishes at a?j. Since the degree of these polynomials is < 3d, we conclude 
that each of these polynomials vanishes on /. Similarly, p vanishes on I. Therefore, the 
line I lies in Z and every regular point in / is a flat point. But by definition, Xi are regular 
points of Z. Therefore, I is not a critical line, and it must be a flat line. □ 

Using these lemmas, we will prove that a definite fraction of the lines of £ are either critical 
or flat. 

We define £' z to be the set of lines of 2,z that contain at least (l/200)<S7cL _1 points of & z . 
Lemma 4.17. Each line in £! z is either critical or flat. 

Proof. Since every point of & z is either critical or flat, each line in 2! z contains either 
( 1/400) STcL" 1 critical points or (1/400) SfcL -1 flat points. But by Lemma S33 d < 
10 -8 SkL~ l . So by Lemmas 14.151 and 14.161 each line of Sl z is either critical or flat. □ 

Now we show that Z! z contains a definite fraction of the lines of £. 
Lemma 4.18. The number of lines in 2l z is > (1/200)L. 
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Proof. Recall that we assumed in the statement of Proposition 14.71 that there are at least 
(1/100)L lines of £ that each contain > (l/\00)SkL~ 1 points of &. We denote these lines 
by £ C £. 

Suppose a line I lies in £q\£' z . Then / contains at least (l/W0)SkL~ 1 points of 6. But it 
contains less than (l/200)SkL~ 1 points of & z . Therefore, it contains at least (1/200) 5£;L _1 
points of & \ & z . So we get the following inequality. 

{l/mS)SkL- x \&Q \ Sf z \ < 1(6 \ 6' z ,£ \ £z)- 
But since each point of 6 lies in at most 2k lines of £, 

1(6 \ e' z , £ \ £f z ) < \ ©'z, £ ) < 2k \ & \ & 'z\- 

Lemma P4. 131 savs that |6 \ & z \ < lO -7 ^. Assembling all our inequalities, we see that 

(l/200)5A;L- 1 |£o \ £f z \ < 2k • 10" 7 ,S. 
Simplifying this expression, we see that 

|£o\£'z| < 4 - 10" 5 L. 

So almost all the lines of £o lie in £' z . In particular, £' z contains > (1/200)L lines. □ 

Next we bound the number of critical lines in Z. 

Lemma 4.19. A surface Z of degree d contains < d 2 critical lines. 

This lemma follows from Bezout's theorem applied to p and Vp. See Proposition 3 in 
[EKE]. 

If the constant A from inequality 4.1 is sufficiently large, then d 2 will be much less than L. 
We record this calculation in the next lemma. 

Lemma 4.20. If A is sufficiently large, then d < 10 _4 L 1 / 2 . 

Proof. The inequality 4.1 implies that 1 < A~ 1 SL~ 3 / 2 k 2 . Therefore 

d < dA^SL-^k 2 < A- l L l l 2 k-\ 
Choosing A sufficiently large finishes the proof. □ 
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In particular, we see that Z contains at most d 2 < 10~ 8 L critical lines. Since £! z contains 
at least (1/200)L lines, we see that most of these lines must be flat. In particular, 2! z 
contains at least (1/300)L flat lines of Z. 

We are trying to prove that Z contains some planes. Let Z p i denote the union of all planes 
contained in Z. We let Z denote the rest of Z so that Z = Z p iL)Z. In terms of polynomials. 
Z is the vanishing set of p. The polynomial p factors into irreducibles: p = p\P2---- Some 
of these factors have degree 1, and some factors have degree more than 1. Each factor of 
degree 1 defines a plane, and Z p i is the union of these planes. The product of the remaining 
factors is a polynomial p, and Z is the zero-set of p. A line which lies in both Z p i and Z 
is actually a critical line of Z. So a flat line of Z lies either in Z p i or in Z, but not both. 
A flat line of Z that lies in Z is a flat line of Z. The number of flat lines in a surface of 
degree < d is bounded by the following lemma from [EKS . 

Lemma 4.21. ([EKS], Proposition 8) If Z is an algebraic surface of degree < d with no 
planar component, then Z contains < 3d 2 flat lines. 

We have seen that £ contains at least (1/300)L flat lines of Z. But Z contains only 3-10 -8 L 
flat lines. The rest of the flat lines lie in Z p i. In particular, £ contains at least (1/400)L 
lines in Z p \. 

Finally, we observe that the number of planes in Z p i is < d < L 2 S~ 1 k~ 3 . So one of these 
planes must contain > Sk 3 L _1 lines of £. In other words, B > Sk ?J L~ l . 

At several points in the argument, we needed A to be sufficiently large. We now choose 
A large enough for those steps. We conclude that either S < AL 2 k~ 3 / 2 + Lkr 1 or else 
S < LBk~ 3 . This finishes the proof of Proposition 14.71 □ 

Proposition 14. 71 is the heart of the proof of Theorem l4.51 We are going to reduce the general 
case to Proposition 14.71 First we remove the assumption that many lines have roughly the 
average number of points. 

Proposition 4.22. Let k > 3. Let £ be a set of L lines in R 3 with < B lines in any 
plane. Let & be a set of S points so that each point meets between k and 2k lines of £. 

Then S < C[L 3 / 2 k~ 2 + LBk~ 3 + Lk' 1 } . 

Proof. Let £i be the subset of lines in £ which contain > ( 1/100) SkL^ 1 points of (3. If 
|£i| > (1/100)L, then we have all the hypotheses of Proposition 14.71 an d we may conclude 

S < C [L 3/2 £T 2 + LBk- 3 + LAT 1 ]. 

We are going to prove that S obeys this same estimate, with the same constant, regardless 
of the size of £i. The proof will go by induction on the number of lines. 
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From now on we assume that |£j| < (1/100)L. The lines in £1 contribute most of the 
incidences. In particular, we have the following inequality. 

7(6, £ \ £1) < (l/W0)SkL- 1 • L = (1/100)5%. 

We define 6' C 6 to be the set of points with > (9/10) A; incidences with lines of £i. 

If x is in © \ & , then x lies in at least k lines of £, but less than (9/10)A lines of £j. So x 
lies in at least (1/10) A; lines of £ \ £i. Therefore, 

(l/10)fc|6 \ 6'| < 1(6 \ 6', £ \ £i) < 1(6, £ \ £i) < (1/100)5A. 
Rearranging, we see that |6 \ &\ < (1/10)5, and so |6'| > (9/10)5. 

A point of ©' has at least (9/10)A: incidences with £i and at most 2k incidences with £i. 
This is a slightly larger range than we have considered before. In order to do induction, 
we need to reduce the range. We observe ©' = U &_ , where & + consists of points with 
> k incidences to £i and &_ consists of points with < k incidences with £i. We define ©i 
to be the larger of & + and &_. It has > (9/20)5 points in it. 

If we picked ©i = & + then we define k\ = k. If we picked ©i = then we define k± 
to be the smallest integer > (9/10)A. Each point in ©i has at least k\ and at most 2k± 
incidences with lines of £i. Also, k\ is an integer > (9/10) A; > 27/10, so k\ > 3. 

The set of lines £i and the set of points ©i obey all the hypotheses of Theorem l4.22l (using 
k\ in place of k and using the same B). There are fewer lines in £i than in £. Doing 
induction on the number of lines, we may assume that our result holds for these sets. If 
we denote |£i| = L\ and |©i| = 5i, we get 

5i < C [Li /2 Af 2 + BLiki 3 + L{iq x \. 

Now 5 < (20/9)5i. Also, L\ < (1/100)L. And k x > (9/10)A. 
Therefore, 

5< (20/9)5i < [(20/9)(l/100)(10/9) 3 ]C [L 3/2 A- 2 + L J BA- 3 + LA;- 1 ]. 
The bracketed product of fractions is < 1, and so 5 obeys the desired bound. 

□ 

Finally, we can prove Theorem 14. 51 
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Proof of Theorem \4-5\ Let k > 3. Suppose that £ is a set of L lines with < B in any plane. 
Suppose that (3 is a set of points, each intersecting at least k lines of £. 

We subdivide the points © = U^_ 6j, where <5j consists of the points incident to at least 
2^k lines and at most 2^ +1 k lines. We define kj to be 2?k. Then Theorem 14.221 applies to 
(£, <Sj, kj,B), and we conclude that 

\<&j\ < C [L 3 / 2 kJ 2 + LBkJ 3 + Lkj 1 ] 

< 2- j C [L 3/2 k- 2 + LBk~ 3 + LAT 1 ]. 
Now S < J2j \&j\< 2C [L 3 / 2 k- 2 + LBk- 3 + Lk' 1 }. 

□ 

5 Appendix: The example of a square grid 

In this section, we return to Erdos's example of a square grid of points. When P is a 
square grid of N points, we show that \Q(P)\ > N 3 logN and \Gk(P)\ > N 3 k~ 2 for all 
2 < k < iV/2000. So the estimates in Propositions 12.21 and 12.51 are sharp up to constant 
factors. We also study the set of lines £ associated to a square grid P. This set of lines 
shows that many of our incidence estimates are sharp up to constant factors. 

Let S > 1 be an integer. Let P be the grid of points (x, y) where x and y are integers with 
norm < 2S. Note that the number of points in \P\ is N = (2S + l) 2 . Let 2 be the set of 
lines in R 3 associated to the set P, as in Section 2. 

Lemma 5.1. If a,b,c, and d are positive integers with norm < S, then the line from 
(a, b, 0) to (c, d, 1) is contained in £. 

Proof. Using the parametrization in 12.31 we see that the line from (a, b, 0) to (c, d, 1) is the 
line L pq , where p and q are defined by the following equations. 



{l/2)(p x + q x ) = o; {l/2){p y + q y ) = b; (l/2){q y - p y ) = c - a; (l/2)(p x - q x ) = d - b. 

Solving these equations, we get p = (a + d — b, b — c + a) and q = (a — d + b,b + c — a). 
Since a,b,c, and d are positive integers of norm < S, it follows that p x ,p y ,q x , and q y are 
integers of norm < 25, and so p and q lie in P. □ 
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Let £o C £ be the set of lines from (a, b, 0) to (c, d, 1) where a, 6, c, and d are positive 
integers with norm < 5. In the proposition below, we study the incidences of £o- Note 
that |£ | = 5 4 . 

Proposition 5.2. Let be the set of points in R 3 that lie in at least k lines o/£o- For 
any k in the range 2<k< (l/400)5 2 , |6 fc | > S 6 k~ 2 . 

Proof. Consider a point x in R 3 contained in the slab < Xs < 1. We define a map 
F x : R 2 — > R 2 by saying that F x (a,b) = (c,d) if the line from (a, 6,0) through x hits 
(c, d, 1). We define G to be the integral grid in the plane given by {(a, b)} with 1 < a, b < 5. 
The number of lines from £o which pass through x is exactly the cardinality of F X (G) n G. 
Now any intersection of two lines from £o will have rational coordinates, so we can assume 
the coordinates of x are rational. Let us say that the X3 coordinate of x is p/q, written in 
lowest terms. 

By a similar triangles argument, F X (G) is a square grid with spacing ^y-. Since p and q are 
in lowest terms, the intersection F X (G) D G will be a rectangular grid with spacing q — p. 
The edges of this rectangle will have length < 5. So the number of points in F X {G) D G is 
at most S 2 {q — p)~ 2 - On the other hand, the edges of this rectangle have length < S^j^-. 
Therefore, the number of points of F X (G)C\G is at most S 2 p~ 2 . Combining these estimates, 
we see that \F X (G) DG\< 4S 2 q- 2 . 

Let us say that the middle half of G, written G m iddie C G, is the integral grid {(a, b)} 
with (1/4)5 < a,b < (3/4)5. If F x maps a vertex from G m iddie hito G, then the number of 
intersections between F X (G) and G is fairly close to this upper bound. Using the arguments 
from the last paragraph, it's straightforward to show that \F X (G) n G\ > (l/100)5 2 g~ 2 
whenever F x (G mid die) intersects G. 

Let us define X(p, q) to be the set of x = (xi, X2,p/q) so that F x (G m iddie)^G is non-empty. 
The set X(p,q) lies in whenever k < (l/100)5 2 g~ 2 . Equivalently, X(p,q) lies in &k 
whenever q < (l/10)5fc- 1 /2. 

For any pair of points (a±, b{) € G m iddie and («2, 62) € G, there is a unique x € X(p, q) so 
that F x {a\,b\) = (g^)^)- There are ~ 5 4 such pairs of points. Each element of X(p,q) 
corresponds to at least (l/100)5 2 g -2 pairs of points, and at most 45 2 g~ 2 pairs of points. 
Therefore \X(p,q)\ ~ S 2 q 2 . 

Now we fix k < (l/400)5 2 . We pick q in the range (l/20)5fc- 1 /2 < q < (l/l0)5fc- 1 /2. 
Because k is not too big, this range of q contains some integers. For each p coprime to q, 
X(p,q) C &k- The sets X(p,q) are clearly disjoint, and so 

(l/10)5fe- 1 /2 (l/10)S'fc- 1 /2 

\&k\> J2 E \^{p,Q)\> E ^i)sv. 

q=(l/20)Sk- 1 / 2 0<p<q,gcd(p,q)=l g=(l/20)5fe- 1 /2 
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The sums of the Euler totient function 4>{n) are well studied. Theorem 3.7 in [A] gives the 
asymptotic Y2q=i ^(l) = + 0(x\ogx). Therefore, Y^=x ~ x2 - Therefore, 

l©fcl > (Sk- 1 ' 2 ) 2 S 2 q 2 ~S 6 k- 2 . 

□ 

Recall that \Gk(P)\ is at least |G' fc (P)|, which is the number of points lying in at least k 
lines of £. So we see that \G k (P)\ > S & k' 2 ~ N 3 k~ 2 for all 2 < k < (l/400)5 2 < iV/2000. 
The equation 12.21 gives 

N Af/2000 

\Q(P)\^k\G k (P)\> ]T N^-^N 3 log N. 

k=2 k=2 

Now we consider how sharp our incidence theorems are. The set of lines £o C £ has 
< ~ S 2 lines in any plane or doubly ruled surface by Proposition 12.81 This example 
shows that Theorems 12.101 and 12.111 are sharp up to constant factors. 

Next we consider Theorem 14.51 The lines £o correspond to Example 3 in Section 4. The 
three examples show that Theorem l4.5l is sharp up to constant factors as long as B > L 1 / 2 . 
The example £o has B ~ L 1 / 2 . For much smaller values of B, we don't know what happens. 
For example, suppose that £ is a set of L lines in R 3 with at most 100 lines in any plane. 
How many points can be incident to three lines of £? Or suppose that £ is a set of L lines 
in R 3 with at most 100 lines in any plane or doubly ruled surface. How many points can 
be incident to two lines of £? 
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